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Abstract 

We propose a dynamical process for network evolution, aiming at 
explaining the emergence of the small world phenomenon, i.e., the sta- 
tistical observation that any pair of individuals are linked by a short 
chain of acquaintances computable by a simple decentralized routing 
algorithm, known as greedy routing. Previously proposed dynamical 
processes enabled to demonstrate experimentally (by simulations) that 
the small world phenomenon can emerge from local dynamics. How- 
ever, the analysis of greedy routing using the probability distributions 
arising from these dynamics is quite complex because of mutual de- 
pendencies. In contrast, our process enables complete formal analysis. 
It is based on the combination of two simple processes: a random walk 
process, and an harmonic forgetting process. Both processes reflect 
natural behaviors of the individuals, viewed as nodes in the network 
of inter-individual acquaintances. We prove that, in fc-dimensional lat- 
tices, the combination of these two processes generates long-range links 
mutually independently distributed as a fc-harmonic distribution. We 
analyze the performances of greedy routing at the stationary regime 
of our process, and prove that the expected number of steps for rout- 
ing from any source to any target in any multidimensional lattice is 
a polylogarithmic function of the distance between the two nodes in 
the lattice. Up to our knowledge, these results are the first formal 
proof that navigability in small worlds can emerge from a dynamical 
process for network evolution. Our dynamical process can find prac- 
tical applications to the design of spatial gossip and resource location 
protocols. 
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1 Introduction 



Models relating geography and social-network friendship enable a good un- 
derstanding of the small world phenomenon, a.k.a., six degrees of separation 
between individuals [HI [29]. In these models, the probability of befriending 
a particular person is assumed to be inversely proportional to the number 
of closer people, fitting with what was observed experimentally (cf. |28j). 
Under this assumption, it was proved that, using ad hoc probability distri- 
butions, many classes of graphs are navigable, that is, a simple decentralized 
routing procedure enables efficient routing from any source to any target. 
(By efficient, we mean, as it is standard in this framework, that routing 
from any source s to any target t takes a poly logarithmic expected number 
of steps). For instance, such a navigability property is satisfied in multi- 
dimensional meshes |24| . in graphs of bounded ball growth [13], and more 
generally in graphs of bounded doubling dimension [34] . In all these cases, 
a graph G, that may not only represent geography but also other proximity 
measures like professional activities, religious beliefs, etc., is enhanced with 
additional links chosen at random. More precisely, every node is given some 
long-range links pointing at other nodes in the graph. For each long-range 
link added at a node u, the probability that the head of this link is v is 
inversely proportional to the size of the ball of radius distort, v) centered at 
u in G, hence depending on the density of G around u. This setting applies 
to weighted graphs too [26], and to infinite graphs as well [13]. For instance, 
in the /c-dimensional lattice the probability that u has a long-range link 
pointing at v is essentially proportional to l/d k where d is the distance be- 
tween u and v in the lattice. This setting of the long-range links enables 
greedy routing to perform in polylogarithmic expected number of steps (as 
a function of the distance in the lattice between the source and the target). 

1.1 Navigability as an emerging property 

In [25] (Problem 7), Jon Kleinberg asks about "what kinds of growth pro- 
cesses or selective pressures might exist to cause networks to become more 
efficiently searchable". Many attempts have been made to explain how the 
density-based distribution of the long-range links can emerge with time from 
the evolution of a network. Inspired by the world wide web or by P2P file- 
sharing systems, all the models we are aware of have considered the augmen- 
tation process (or rewiring) of a static graph used by its nodes for searching 
information. Our work uses a different approach, starting from the following 

1 Greedy routing [24] aims at modeling the routing strategy performed by the indi- 
viduals in Milgram experiment. In a graph G enhanced with long-range links, a node u 
handling a message of destination t selects among all its neighbors, including its long-range 
contact (s), the one that is the closest to the target t according to the distance in the base 
graph G, and forwards the message to that node. 
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observations. One the one hand, anyone of us can call or email any person 
in the world. On the other hand, to do so, it is frequently the case that 
we have met this person before. We thus start from the assumption that 
long-range connections are between remote people who have met once in 
the past. In other words, long-range links are emerging from nodes mobil- 
ity, that we model by random walks in this paper. Another observation is 
that people forget some of their former acquaintances along with time. This 
forgetting mechanism represents the well understood fact that one cannot 
maintain close relationships with an explosive number of people. Thus we 
couple the random walk process with a forgetting process, and prove that 
this idealistic setting is sufficient to insure polylogarithmic navigability with 
simply one long-range connection per node. 

1.2 Rewiring processes 

Clauset and Moore [9] proposed the following rewiring process for the mul- 
tidimensional lattice, inspired by the actions of surfers on the web. While 
routing from a source s to a target t, if the target is not reached after r steps, 
then the long-range link of s is rewired to point at the current node x. The 
threshold r is set based on the distance (in the lattice) between s and t, and 
on the expected time of greedy routing from s to t when the /j-dimensional 
lattice is augmented using the /c-harmonic distribution [Mj • The simulation 
results presented in [S] show that the distribution / of the link lengths con- 
verges to the power law h(d) oc l/d k . Sandberg and Clarke [32] proposed a 
different rewiring process, based on Freenet feedback mechanisms [S]. This 
iterative process selects, at each phase, two nodes s and t uniformly at ran- 
dom, and constructs the greedy path s = xq, xi, ■ ■ ■ , zrfc-i, %k = t from s to t. 
For every i £ {0, 1, . . . , k}, the long-range link of X{ is rewired with probabil- 
ity p, to point at t. The k + 1 decisions (rewiring or not) are taken mutually 
independently. This process is analyzed in [31]. It is proved that, under 
some hypotheses, the process converges. Moreover, the stationary distribu- 
tion / of the link lengths can be fully characterized. In the fc-dimensional 
lattice, it is close to the power law h(d) oc l/d k for an appropriate p, and 
simulations show that greedy routing in rings and meshes enhanced using 
the stationary distribution / performs as efficiently as when these networks 
are enhanced using the 1- and 2-harmonic distributions, respectively. 

For both [9] and [32], the complete formal analysis of the process remains 
open (even the formal characterization of the stationary distribution of the 
processes described in [9] remains open). The difficulty of the analysis is 
due to the dependencies between the long-range links generated by the pro- 
cesses. In particular, the computation of the greedy routing performances 
is a challenge when the long-rank links are not mutually independent. So, 
building further theory upon these two models looks quite difficult. 

In this paper, we propose a dynamical network model based on the com- 



2 



bination of two simple processes: a random walk process, and a harmonic 
forgetting process. We prove that the combination of these two processes gen- 
erates long-range links mutually independently distributed as a distribution 
that resembles the density-based distribution, and from which navigability 
provably emerges. 

1.3 Sketch of our network evolution process 

In our network evolution process, called move-and-forget, or M&F for short, 
individuals are modeled by tokens moving from node to node in the k- 
dimensional lattice Z fc , for some fixed integer k > 1 (the dimension of the 
lattice may be related to the number of proximity criteria used by the indi- 
viduals for routing). Initially, each node is occupied by exactly one token. 
These tokens are moved mutually independently during the execution of the 
dynamical process, according to a random walk. 

Tokens are attached to the heads of the long-range links, whose tails 
are the nodes from where the tokens initially started their random walks. 
Using the analogy of individuals moving in the geographical world, each long- 
range link indicates an acquaintance between an individual located at a fixed 
geographical point (where the token initially stood) and some individual 
located at some geographical coordinates (where the token currently stands) . 

The random walk process is coupled with another dynamic: nodes may 
forget their contacts through their long-range links. The motivation for our 
forgetting process is that individuals may loose contact with former good 
friends, but they meet new people among which some may become close 
friends. Since older acquaintances indicate stronger relationships, we assume 
that they have less probability to be forgotten than recent ones. More 
precisely, a long-range link of age a, that is a long-range link that survived 
a steps of the forgetting process, is forgotten with probability <p{a) oc 1/a. 
When a long-rang link is forgotten by a node, it is rewired to point at this 
node (hence creating a self-loop). The token at the head of the forgotten 
link is removed, and a new token is launched at the node. (A new local 
relationship replaces an old remote relationship). 

Note that M&F is defined independently from the dimension k of the lat- 
tice: tokens execute random walks, and they are forgotten with a probability 
that depends only of their ages. 

1.4 Our results 

We prove that, for any fixed integer k > 1, the m&f rewiring process 
sketched above converges in the /c-dimensional lattice to a distribution / 
of the link lengths that resembles the fc-harmonic distribution. Precisely, 
we prove that there exists do > and two positive constants c and c', such 
that, for any d = (d\, . . . , dk) G Z fc with \di\ > do for alH € {1, . . . , k}, we 
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Table 1: Properties of known network evolution processes compared to M&F 
have 



d || fe -ln 1+e II d 



< /(d) < 



ln fc / 2 || d 



•In 



l+e 



d 



where e > is a fixed (arbitrary small) parameter of m&f, and || • || denotes 
the £oo norm. 

Moreover, M&F guarantees the mutual independence of the long-range 
links. As a consequence, the performances of greedy routing in the lattice 
enhanced using the distribution / can be analyzed formally. We prove that 
the expected number of steps of greedy routing from any source s to any 
target t at distance d in the fc-dimensional lattice satisfies 

E[X s>t ] <0(ln 2+e d). 

Therefore, greedy routing performs polylogarithmically as a function of the 
distance between the source and the target. In particular, the performances 
of greedy routing are essentially the same as the ones obtained by Klein- 
berg [23] using the ad hoc fc-harmonic distribution [24] . 

Up to our knowledge, these results are the first formal proof that nav- 
igability in small worlds can emerge from a dynamical process for network 
evolution (see Table [T]). Moreover, m&f is simple (by just coupling two 
simple dynamics), naturally distributed (each node takes care of just its to- 
ken), robust (the loss of one token simply requires to launch a new token), 
and scalable (by direct adaptations of the infinite lattice setting to square 
toroidal meshes of arbitrary sizes). 

Last but not least, m&f can find practical applications, including the 
design of distributed spatial gossip and resource location protocols. 



1.5 Related works 

The search for a network evolution process that could explain the emergence 
of the small world phenomenon in social networks started with the pioneer- 
ing work of Watts and Strogatz [35] who proposed a rewiring process in the 
cycle, generating networks possessing small diameter and large clustering 
coefficient, simultaneously. Adding random matchings to cycles, as in [5j, 
yields graphs with small diameter, but non necessarily with small clustering 
coefficient. As far as navigability is concerned, these networks do not sup- 
port efficient decentralized routing mechanisms [21]. Albert and Barabasi [2] 
produced a thorough investigation of the preferential attachment model [33] . 
Although the preferential attachment model enables the design of efficient 
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search procedures under specific circumstances (see [16] and the references 
therein), the recent lower bounds in [12] show that polylogarithmic routing 
cannot be achieved in general in networks generated according to this model. 
Recently, Liben-Nowell and Kleinberg [27] tried to infer which interactions 
in social networks are likely to occur in the near future from the observation 
of the existing ones, but navigability is not of their concern. Actually, as 
far as we know, the only network evolution models from which polylogarith- 
mic routing emerges are the aforementioned ones (9[ [32] , which we already 
discussed. 

Following up the seminal work of Kleinberg [23], a large literature has 
been dedicated to the analysis of greedy routing in graphs enhanced by long- 
range links set according to various kinds of probability distributions (see, 
e.g., [II [131 O (13 S3])- These papers proved that several large classes of 
graphs can be enhanced by long-range links so that greedy routing performs 
in polylogarithmic expected number of steps. A lower bound of i}(n 1 ^ l ° sn ) 
expected number of steps for greedy routing in arbitrary graphs has been 
proved in [18], and an upper bound of 0(n 1 ^) has been proved in [T7] . 
Lower bounds for the cycle can be found in 13[ [TU]. 



2 The Move-and-Forget (m&f) Rewiring Process 

2.1 Process description 
2.1.1 Random walks 

Let k > 1 be an integer. The rewiring process move-and-forget (m&f for 
short) assumes that each node in the /c-dimensional lattice Z fc is initially 
occupied by exactly one token. These tokens move mutually independently 
according to random walks. That is, each token is given a set of k fair 
coins Cj, i = 1, . . . , k. At each step of its walk, each token flips its k coins, 
and moves in the ith dimension of the lattice in the positive direction if 
Cj is head, and in the negative direction if it is tail. More precisely, let 
X(t) G Z fc denotes the position of a token in the lattice after t steps of m&f, 
assuming that the token initially started at node (u\, . . . , u^) S Z fc . We have 
X(0) = (ui, . . . , Uk), and, for t > 1, X{t) = (Xi(t), . . . , Xk(t)) satisfies 

X(t) = ! X i (*-!) + 1 with Probability 1/2; 
i( ' \ Xi(t - 1) - 1 with probability 1/2. U 



2.1.2 Setting of the long-range links 

Tokens are attached to the heads of the long-range links, whose tails are 
the nodes from where the tokens initially started their random walks (see 
Figure (Ha)). The head of a long-range link is called the long-range contact 
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Figure 1: Dynamic of the long-range links in M&F. 



of the tail of this link. Hence the long-range contact of a node u is the node 
v currently occupied by the token launched by node u. 

2.1.3 Forgetting process 

Nodes may forget their contacts through their long-range links. More pre- 
cisely, a long-range link of age a > 0, that is a long-range link that survived 
a steps of the forgetting process, is forgotten with probability 4>{a). When 
a long-range link is forgotten by a node, it is rewired to point at this node 
(see Figure [T^b)). The token at the head of the forgotten link is removed, 
and a new token is launched at the node. This new token starts another 
random walk in Hence, if A(t) G N denotes the age of the long-range 
link of some node u, that is the number of steps between time t and the last 
time this link was rewired during the execution of M&F, and if C(t) denotes 
the long-range contact of node u at step t, then we have C(t) = X(A(t)). 

The forgetting function <f) has a huge impact on the distribution of the 
long-range link lengths. In this paper, we will consider <f>(a) oc 1/a. The 
precise setting of (ft will appear more complex for technical reasons onljl 
(series convergence for infinite lattices, normalization, etc.). In fact, its 
behavior essentially reflects a decreasing of the forgetting probability that 
is inversely proportional to the age of the relationships. The precise setting 
of (f> is described in the next section which explains the connections between 
the random walk X, the forgetting function <p, and the distribution / of the 
long-range link lengths. 

2.2 Setting of the forgetting function 

We first prove that the age of the long-range link resulting from the execution 
of M&F at a node has a stationary distribution (the proof of this lemma can 

2 For instance, one needs X^ a >o ^( a ) t° diverge since otherwise the Markov chain A(t) 
would be transient, and links could survive infinitely with positive probability. However, 
on the one hand, just setting <f>(a) = 1/a would make A(t) recurrent null (and thus for 
any a we would have Pr{A(t) = a} converging to as t goes to infinity), but, on the other 
hand, setting 4>{ a ) = l/a a with a < 1 would not yield navigability. 
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be found in Appendix lAl) . 

Lemma 1 For any function (f> in [0, 1) such that the series of general term 
II^ =1 (1 — <j>(i)) is finite, (A(t))t>o * s a Markov chain which is irreducible, 
aperiodic, and positive recurrent, with stationary probability distribution n 
where 

7r(a) = : , 

for all a > 0. 

Definition 1 We define the forgetting probability <p as the following func- 
tion: 

[ if a = 0,1, or 2; 



In a 

where e > is arbitrary small. 

Note that 0(a) = ± + o (A). Indeed, 

ln(a-l)\i+e / ln(l- l/o)\l+e 1 + e ( \ 
' _ H ; =1 ; ho 



In a / V In a / a In a Vain a 

If (f) is denned according to Eq. ([2]), then Lemma [1] enables to give a close 
formula for n (the proof of this lemma can be found in Appendix [Bj . 

Lemma 2 If (p is defined according to Eq. then there exists a constant 
c > such that 7r(0) = vr(l) = vr(2) = c and for any a > 3, 

7T(o) 



a ln 1+e a 



Finally, the relationship between the stationary distribution of the long- 
range link ages and the stationary distribution of the long-range link lengths 
is made explicit in the following lemma (see proof in Appendix ICl). 

Lemma 3 The distribution of the long-range links converges to the distri- 
bution f satisfying, for any d 6 1i k , 



/(d) = 5>(a)-Pr{X(a)=d}. 



a>0 



3 Analysis of the dynamical process M&F 

In this section, we analyze the stationary distribution of the long-range 
link lengths in the fe-dimensional lattice, and prove that this distribution 
resembles the fc-harmonic distribution. 
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Theorem 1 There exist do > and two positive constants c and c' such 
that, for any d = (d\, . . . , d^) 6 Z fc with \d%\ > do for all i 6 {1, . . . , k}, we 
have 

c d ln fc / 2 || d || 

|| dp -ln 1+e ||d|| " /(d) - || d f -ln 1+e ||d|| 
where e > is the fixed parameter o/m&f, and || • || denotes the norm. 

To prove the theorem, we first prove that, for large distances d, a random 
walk of age a cannot be of length d unless a > S7(d 2 ). More precisely, we 
establish an exponentially small upper bound for the probability for a long- 
range link to be of length d at age a = o{d 2 ). Second, we prove that if the 
age a is sufficiently large, then the chance for a random walk to reach a given 
distance d at age a is proportional to Summing this probability over 

all values of a larger than d 2 allows us to conclude that the transform of 
the age distribution tt described in Lemma [3] is approaching the /c-harmonic 
distribution. 

Let us establish some basic properties satisfied by random walks in di- 
mension 1. We will extensively use the following Chernoff bound. Let T be 
a sum of Bernouilli variables, with expectation [i. Then [30J: 

t 2 

Pr{|T-^| > t} < 2 exp(- — ) for any t < [i . (3) 

4/i 

The following lemma specifies what must be the minimum order of mag- 
nitude for a in order to contribute significantly to the sum defining / in 
Lemma El 



distance d^TL, we have Pr {X(a) = d} < 2 ■ exp ^ 



Lemma 4 Let X be a random walk in Z. Then, for any age a > and any 

32o / 

Due to lack of space, the proof of the lemma is omitted (it can be found 
in Appendix |Pj) . 

We now compute an estimation of Pr {X(a) = d} when a is sufficiently 
large. We will use the following asymptotic equivalent of the binomial co- 
efficient, that can be derived by application of the Stirling formula. Let rej 
and mi be two sequences of positive integers such that rii — > oo, m« — > oo, 
and rij — m; — > oo when i grows to infinity. Then 



Tli\ I / ~ »i' _ 



"V \/2tt \j rrn ■ (ni - rrn) ■ (m - mi) n ^ m * ' 

Lemma 5 Let X be a random walk in Z. For any £ > 0, there exists do > 1 
such that, for any \d\ > do and a > ^.^u i ; we have 



( i -«'V^ exp (-^) £pr{x(<!) = <i)s(i+c) V^ exp (-^ 
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Due to lack of space, the proof of the lemma is omitted (it can be found in 
Appendix [E]). We are now ready to prove of the lower bound of Theorem [TJ 
For the sake of simplicity, let us first assume that the dimension of the 
lattice is 1. In this case, one can apply the results from the previous section 
directly. For any a > |d 2 we have 

( 3d 2 \ 
6XP \-^)- l,e - 

Therefore, for any ( > 0, there exists do large enough and a > |d 2 such that 
Lemma [5] yields: 

Pr{X(a) = d}> IzlJl^. 

e V vr sja 

Thus : 

m = 2>W«0 = -W.) > ^f- £ „.,/,. ,!,■+.,„) ■ 

More generally, in the /c-dimensional lattice, let us denote the position of the 
random walk by X{a) = (Ai(a), • • • , X^^a)). From the setting of m&f, each 
Xi is an unbiased random walk in dimension 1, and the XiS are mutually 
independent. We can thus apply all the results from the previous section 
independently for each coordinate of d = (di, . . . , dfc). Assuming that 

\d%\ > do for all i G {1, . . . , k}, 
we can apply Lemma [5] to every dimension. We get: 

3 n j n2 . w, _ r-i r,_r^/_A ji^i-C [21 _( 3d, 2 



a>-||d|| 2 ^ Vi6{l,...,fc},Pr{X i (o) = d < }> -\ ■= exp 

4 e V vr yja \ 4a 

For a > I || d || 2 , we have, 



3d? 3 d 2 „ , , / 3d, 2 , 

— L < U !!_ < l and thus exp *- > 1 e. 

4 a 4a \ 4a 1 



e V vr x/a 



As a consequence, by Lemma El 

Pr{X(a) =d} = PT{X 1 (a) = dt, . . . ,X k (a) = d h } > 
hence 

/<d) = £ft { *(.) = d>.*(.)>(l^/lY £ ^t^m - 

a>0 V / a>||jd|| 2 V 7 

The lower bound is then a direct consequence of the following result with 
jy _ 3 ll d l ^]j e proof of this lemma can be found in Appendix [FJ . 
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Lemma 6 For any e > 0, and any N > e 2 ^ 1+e \ we have 

2/(fc + l) < ^ 1 < 2/fc 

ln i+e AT - l+(fc/2) ln 1+e a ~ (A - l) fc / 2 ln 1+e (iV 

Due to lack of space, the proof of the upper bound of Theorem [1] is 
omitted (it can be found in Appendix iGl). 

4 Applications 

In the previous section, we have shown that the distribution / of the long- 
range link lengths is provably converging to a distribution that resembles the 
A;-harmonic distribution. In this section, we show that greedy routing can 
be formally analyzed at the stationary state of this distribution. Greedy 
routing can be formally analyzed for two reasons: (1) The distribution / 
of the long-range links constructed by M&F can be bounded formally (cf. 
Theorem [TJ; (2) The long-range links resulting from M&F are mutually in- 
dependent. Based on these two facts, we can establish the theorem below 
(see proof in Appendix |H]) . 

Theorem 2 In the k- dimensional lattice augmented with the long-range 
links at the stationary distribution of the dynamical process m&f, the ex- 
pected number of steps of greedy routing from any source node s to any 
target node t at distance d is 0(ln 2+e d). 

In the rest of the section, we discuss how M&F can find practical appli- 
cations to the design of spatial gossip and resource location protocols. 

Gossip-based protocols, a.k.a., epidemic algorithms [TO], have been in- 
troduced as a methodology for designing robust and scalable communication 
schemes in distributed systems. Roughly, in each step, each node u chooses 
some other node v, and sends a message to it. By applying such scheme at 
each node, an information originated at some source s will eventually reach 
its target (s). This methodology can be adapted to various problems, in- 
cluding information spreading, resource location, etc. In [22], Kempe et el. 
introduced spatial gossip, which allowed them to derive efficient solutions 
for many communication problems. In spatial gossip, nodes are arranged 
with uniform density in the A:-dimensional Euclidean space, and, at each 
step of the gossip protocol, node u chooses node v with probability oc l/d ek 
where g > is a fixed parameter, and d is the distance between u and v. In 
particular, it is shown that, for g E (1,2), spatial gossip enables to propa- 
gate information at distance d in time polylogarithmic in d. In [23], Kempe 
and Kleinberg showed that spatial gossip enables to solve larger classes of 
problems, including MST construction and permutation routing. In partic- 
ular, they prove that permutation routing using spatial gossip with g = 1 
performs in polylogarithmic expected number of steps. 




10 



We sketch how the M&F process could facilitate the implementation of 
the protocols in \22\ [23] for networks that take advantage of node mobility, 
as in, e.g., 0[2Ol[2l]. For instance, assume a network composed of a set X of 
fixed nodes and a set Y of moving nodes. Every x £ X connects to the, say, 
kx closer neighbors in X, and to the, say, ky nodes y £ Y that are currently 
the closest to x. Node x keeps all these y's as temporary neighbors, and 
it regularly checks whether these connections must be preserved. For that 
purpose, node x regularly flips biased coins (one for each neighbor y), and 
decides whether it should keep a neighbor or not according to the result of 
this trial. (The bias of the coin is a function <j> of the age of the connection). 
If x decides to forget some y, then x simply replaces y by the node y' £ Y 
that is currently the closest to x. An so on. Assuming that the moving 
nodes perform random walks, and that all nodes are arranged with uniform 
density in the /c-dimensional Euclidean space, Theorem [T] insures that the 
distances between a node and its moving neighbors are roughly distributed 
according to a /c-harmonic distribution. Hence, every fixed node can mimic 
spatial gossip for g = 1 by choosing u.a.r. one if its moving neighbors at 
each step. 

Measuring the impact of the parameters kx and ky on the performances 
of spatial gossip for g = 1, as well as setting up a forgetting function </> 
enabling to implement spatial gossip protocols for g ^ 1 are beyond the 
scope of this paper, but are currently under our investigation. 
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APPENDIX 



A Proof of Lemma [T] 

For all j > 0, we have: 

Pr{A(t + 1) = j | A(t) = i} = <i 



f l-<f>(j) if i = i + 1 
0(i + 1) if j = 
otherwise. 



The Markov chain (A(t))t>o is irreducible because any state i > can be 
reached from any state j > 0. Also, A is clearly aperiodic. Let us define the 
function ir as follows. For any a > 0, 

n tl d-^)) 



with the convention that the product ir? =1 (l — 4>(i)) equals 1. The function 
7T is well defined for all a > by hypothesis on <f>. Clearly, X^a>o 7r ( a ) = 1- 
We now check that it is a stationary distribution. For all a > 0, we have 

^ ?r(i) Pr{A(t + 1) = a \ A(t) = i} = vr(a - 1) • (1 - <f>(a - 1)) = 7r(o), 

i>0 

and 

2>(i)Pr{A(m) = 0|4(t) = i} = 7r(O)0(l)+7r(O)^0(i+l)nj =1 (l-0(j)). 

i>0 i>l 

Let J3(») = nj- =1 (l -4>(j)) for i > 0. We have 1 -<f>(i+ 1) = B(t + 1)/B(i), 
therefore <f)(i + l)-B(i) = B(i) — B(i + 1). Hence we get 

5>(i)Pr{A(t + l) = 0|A(t)=i} = 7r(O)(0(l)+X;(S(i)--B(i + l))) 

i>0 «>1 

= tt(O)(0(1)-B(1)) 
= vr(O). 

Therefore, 7r is a stationary distribution for A, and, since A is irreducible 
and aperiodic, it is unique. Therefore, A is recurrent positive (see Theorem 
3.1, p. 104 in [6]). 

B Proof of Lemma [2] 

Let B(j) = U{ =1 (1 - (f)(i)). We have B(j) = 1 for j = 0, 1, 2, and 



B(j) 



21n 1+e 2 



jln 1+e j 
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for j > 3. Therefore, the series of general term B{j) is finite since e > 0, 
and 4> satisfies the conditions of Lemma [TJ Precisely, we have 




2 In 2 



e 



< CO. 



C Proof of Lemma [3] 



For any time t > and any d, we have: 



Pr{C(t) = d} = Pv{X(A(t)) 



d} 

d and A(t) 



= J>r{X(a) 



a} 



a>0 



= ^Pr{X(a) 



d}Pr{^(t) 



a}, 



a>0 



since the Markov chain A is independent of the position of the token. More- 
over, since A is recurrent positive (Lemma H]), A(t) converges in variation to 
7r when t grows to infinity, that is: Yl a >o I Pr{j4(t) = a} — ir(a)\ tends to as 
t grows to infinity (cf. Theorem 2.1, p. 130 in [6]). Therefore, Pr{A(t) = a} 
can be replaced by vr(a) in the above equality when t grows to infinity. Fi- 
nally Pr{C(t) = d} is independent of t and its stationary distribution is 



D Proof of Lemma [4] 

First, note that the result is straightforward if a < |d| since the random 
walk cannot be at distance d in less than |g?| time steps. Thus we can 
assume a > \d\ in the rest of the proof. Similarly, we can assume d ^ since 
the lemma trivially holds for d = 0. Let {Yi,i > 1} be a collection of i.i.d. 
Bernouilli variables that take value 1 with probability 1/2. Let T be defined 



Thus we get that X(a) and 2T(a) — a have the same distribution. Now, 
E[X(a)] = for any a > 0. Thus, for any d ^ 0, 



The random variable T(a) is the sum of a Bernouilli variables with expec- 
tation 1/2. Thus it has expectation a/2, and since \d\/4 is less than this 
expectation, the Chernoff bound of Eq. ([3]) implies the result. 



/(d)- 



by 



T(a) = Y l + • • • + Y a . 



(6) 



Pr{X(a)=d} < Pr{|X(a)-E[X(a)]| > \d\/2} 
< Pr{|T(o)-E[T(o)]|>|d|/4}. 
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E Proof of Lemma [5] 

Assume, w.l.o.g., that d > 0. Fix £ > 0. According to the definition of a 
random walk in Z we have 

Pr{X(a) = d} = — ( . a jWo V 
1 K ' J 2 a \{a + d)/2) 

Let us rewrite (a + d)/2 = (a/2) • (1 + p) and a - (a + d)/2 = (a/2) ■ (1 - p), 
where p = d/a. According to Eq. (|3|) we get that, for any £' > with C' < C; 
there exists do large enough such that for \d\ > do and a > 64 ^ n y , we have: 

Pr{X(a) = d}< A /^4=2= f 7 , . * w^V- (7) 

1 W ' " V 7T • a ^(l - p 2) V (1 + p )(l+p) . (1 _ p )(i-p) J ^ 

On the other hand, for any x £ (—1,1) we have 

((1 • (1 -x)^)- 1 = exp (-(1 + x) ln(l + x) - (1 - x) ln(l - x)) . 

As x approaches zero we have (1 + x) ln(l + x) = x + %■ + o(a; 2 ) , and thus 

(1 + x) ln(l + x) + (l-x) ln(l - x) = x 2 + o(x 2 ). 

This latter expression can be rewritten as: for any v > there exists r/ > 
such that: 

|x| < 77 => exp (-(1 + z,)x 2 ) < ^— < exp (-(1 - u)x 2 ) . 

Since p = ^ < 64 ^ nrf becomes arbitrarily close to zero for large values of d, 
one can chose do large enough so that Eq. ([7]) holds if one replaces the value 
inside the bracket by the above upper bound, with v = 1/2. Hence we get 
that, for d > do and a > 6 f, . , 



Pr {X(a) =d}<(l + C) yj -^--j2== exp (-p 2 a/A) . 

Once again, since p is arbitrarily close to zero for large d, we can choose do 
large enough so that (1 + £')/\A — P 2 < 1 + C- The upper bound in the 
statement of the lemma follows. 

Only equivalent forms have been used to establish the upper bound in 
the statement of the lemma. Thus we can prove the lower bound by applying 
exactly the same arguments. 
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F Proof of Lemma [6] 

Let g : x i— > — 1/ (x fe / 2 ln 1+<: x) . The derivative of this function satisfies 
g'(x) = ^i-" 2 - 1 (ln-< 1+< » x ) + (1 + e)!-*/ 2 -^!,-'' 2 *'* i) 
1 W + t 



x fc/2+i ln 1+e a; V Inx 
Therefore 

* I < a '(x) < k + 1 1 if ~ > e 2(l+e) 

As a consequence, 



k + T K ' ~ x k / 2 ln 1+e x ~ k 
and 

2 5 (x) f 00 1 , 2 . . 



fc + 1 - y a u fc / 2 in 1+£ « - fc 

Eq. © follows directly from this latter inequality. 

G Proof of the upper bound of Theorem [I] 

Again, let us first consider the simple case of dimension 1. Let d > 1. In 
this context, whenever a < d 2 /(641nd), we get by Lemma 0] that 

Pr OT (a) = d} < 2 ■ exp (-2 ln(d)) < 4 • 

More generally, let us denote by z'o the dimension that yields the infinity 
norm of d (i.e., such that |dj | =|| d ||). By applying LemmaSl we get that 

if a ^ ei^M then 

Pr{X l (a) = d l ,--- ,X k (a) = d k } < Pr {X io (a) = d io } 

< 2/4, = 2/ || d || 2 . 

For any £ > 0, there exists do > such that if d{ > do for all % = 1, . . . , k, 
then we can apply Lemma separately for each dimension. If a >|| d || 2 
/(641n || d ||), then 



Vi€{l,...,fc}, Pr{Xi(a) = di} < (1 + C) 

" 7r • a 
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Thus, since, for a fixed a, the random variables Xi{a) are mutually indepen- 
dent, we get 

Pv{X 1 (a) = d 1 ,...,X k (a) = d k } < ((1 + 0. ../I^ 



7r / a k l 2 
As a consequence, 

/(d) = Y, Pr{X(a)=d}vr(a) + £ Pr (X(a) = d} tt(o) 

lldll 2 ^ l|d|| 2 



■(641n||d||) "-(64111111111) 

s + (a+01/lY E 



7T y o 1+ ( fc / 2 )ln 1+e (a)' 

(64 In II d ||) 

One can then complete the proof by using Eq. ^ with N = gj^^|| d || • 

H Proof of Theorem [2] 

Let s G Z fc be a source node, and t £ Z fc be a target node. Assume that 
the distance between s and t in the lattice Z fc is dist(s, t) = d, where dist 
denotes the t\ distance in Z fc . We compute the expected number of steps 
greedy routing takes before reducing the distance to the target by a factor 2. 
Let u = (ui, . . . ,u k ) G Z fc be the current node reached by greedy routing, 
and let 

B = {v G Z k : dist(v,t) < dist(u,t)/2}. 

The probability Pr(u — ► B) that u has its long-range link pointing to a node 
in B satisfies 

Pr{u -> B} = Y Pr { u -» v l- 

We prove a lower bound on this probability. Let 5 = dist(u,t). Let 

S = {x = (xi, . . . , Xk),Xi G { — 1, 0, +1} for i = 1, . . . , 

For c G Z fe and r > 0, let B(c,r) denotes the ball of radius r centered at c, 
that is 

B(c,r) = {v G Z k : dist(c,v) < r}, 
and, for x G S, define 

B - = B < t + I- SE>- 

We have 5 X CB = B(t,5/2) for any x G S*. Moreover, one can easily show 
that there exists x G S such that for any v = (v\, . . . , v^) G -B x an d any 
i 6 {1, ... j fc}, we have |uj — > 5/ (6/c). 

Pr{u -» 5} > V Pr{u -> v} > |B X | • min Pr{u -> v}. 

veBx 
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If 6 > 6kdo then \ui — Vi\ > do for all i, and, by Theorem [TJ we get that for 
any v G £ x , 

Pr{u -> v} > 



u — v || fe • ln 1+e || u — v || 
where || • || denotes the norm. Since || u — v ||< dist(u, v), we get that 

Pr{u -> v} > 



dist(u, v) fc • ln 1+e dist(u, v) ' 
Now, for any v £ i? x , we have v£B and thus dist(u, v) < 35/2. Therefore, 

Pr{ll — ► v) > „, , ^- rjr • 

1 ; " (f)fcin 1+t (f ) 

Since |S X | > O we get that 

Pr{ u ^ B} > Pr( u ^BJ > fi 

As a consequence, at every intermediate node u of greedy routing from s 
to t, if dist(u, t) > 6kdo then the probability of halving the distance to the 
target at the next step is at least ^( ln i+e Since all the long-range links 
resulting from m&f are mutually independent, we get that the expected 
number of steps for halving the distance to the target is 0(ln 1+e c?)). By 
linearity of the expectation, we get that the total expected number of steps 
for routing from s to a node at distance at most 6kdo from the target t is 
at most 

r io §2 ^i 

E [halving the distance 5 from 2 %+1 to 2*] 

i=[log 2 6fedol 
|"log 2 d] 

< Yl 0(m 1+e (2 i+1 )) 

i=[log 2 6kdo] 

< 0(ln 2+e d). 

Once at distance less than 6kdo to the target, greedy routing completes in 
O(l) steps, thus the total expected number of steps of greedy routing from 
s to t is 0(ln 2+e d). 
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